Picture for Sang Michael Xie

Sang Michael Xie

Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes

Add code
Jan 26, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Impact of Pretraining Word Co-occurrence on Compositional Generalization in Multimodal Models

Add code
Jul 10, 2025
Viaarxiv icon

Meta-Designing Quantum Experiments with Language Models

Add code
Jun 04, 2024
Figure 1 for Meta-Designing Quantum Experiments with Language Models
Figure 2 for Meta-Designing Quantum Experiments with Language Models
Figure 3 for Meta-Designing Quantum Experiments with Language Models
Figure 4 for Meta-Designing Quantum Experiments with Language Models
Viaarxiv icon

A Survey on Data Selection for Language Models

Add code
Mar 08, 2024
Viaarxiv icon

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining

Add code
May 24, 2023
Viaarxiv icon

Reward Design with Language Models

Add code
Feb 27, 2023
Figure 1 for Reward Design with Language Models
Figure 2 for Reward Design with Language Models
Figure 3 for Reward Design with Language Models
Figure 4 for Reward Design with Language Models
Viaarxiv icon

Data Selection for Language Models via Importance Resampling

Add code
Feb 06, 2023
Figure 1 for Data Selection for Language Models via Importance Resampling
Figure 2 for Data Selection for Language Models via Importance Resampling
Figure 3 for Data Selection for Language Models via Importance Resampling
Figure 4 for Data Selection for Language Models via Importance Resampling
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models

Add code
Oct 25, 2022
Viaarxiv icon